PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Sopim08g076370.0.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; lamiids; Solanales; Solanaceae; Solanoideae; Solaneae; Solanum; Lycopersicon
Family HD-ZIP
Protein Properties Length: 843aa    MW: 93508.5 Da    PI: 4.9241
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Sopim08g076370.0.1genomeCSHLView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox62.46.9e-20105160156
                         TT--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
            Homeobox   1 rrkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                         +++ +++t  q++e+e+lF+++++p+ ++r +L++ lgL+ rqVk+WFqNrR+++k
  Sopim08g076370.0.1 105 KKRYHRHTVRQIQEMEALFKECPHPDDKQRLKLSQDLGLKPRQVKFWFQNRRTQMK 160
                         688899***********************************************998 PP

2START152.24.4e-483075341206
                         HHHHHHHHHHHHHHHHC-TT-EEEEEXCCTTEEEEEEESSS..............SCEEEEEEEECCSCHHHHHHHHHCCCGG...CT-TT CS
               START   1 elaeeaaqelvkkalaeepgWvkssesengdevlqkfeeskv.............dsgealrasgvvdmvlallveellddke...qWdet 75 
                         ela++ ++elvk+ ++++p+W + s ++ g+evl  +e s+               ++ea r s+vv+m++ +lv  +ld+++    +   
  Sopim08g076370.0.1 307 ELALSSMDELVKMCTSSDPLWIRAS-NDSGKEVLNVEEYSRMfpwpvgvkqngneLKIEATRSSAVVIMNSITLVDAFLDTNKcieLFPSI 396
                         578999*******************.77777777777766667777889*9*******************************999999999 PP

                         -SEEEEEEEECTT......EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE..TTS-EEEEEEEEE-TTS--.-TTSEE-EESSEEEEEEEEC CS
               START  76 lakaetlevissg......galqlmvaelqalsplvp.RdfvfvRyirq.lgagdwvivdvSvdseqkppesssvvRaellpSgiliepks 158
                         + +a+t++v +sg      g lqlm++e+q+l+plv+ R+ +f+Ry++q  ++g+w+ivd  +ds  ++   +++   +++pSg++i++++
  Sopim08g076370.0.1 397 ISRAKTIQVATSGvsghasGSLQLMFMEMQVLTPLVStRECYFLRYCQQnVEEGSWAIVDFPLDSLHNNF-PPPFPYFKRRPSGCIIQDMP 486
                         99***********************************************99***********99988877.57777777************ PP

                         TCEEEEEEEE-EE--SSXXHHHHHHHHHHHHHHHHHHHHHHTXXXXXX CS
               START 159 nghskvtwvehvdlkgrlphwllrslvksglaegaktwvatlqrqcek 206
                         ng+s+vtwveh++++++ + ++++++v sg+a+ga++w++ lqrqce+
  Sopim08g076370.0.1 487 NGYSRVTWVEHAEVEENPVNQIFNHFVTSGVAFGAQRWLSILQRQCER 534
                         **********************************************97 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:1.10.10.605.1E-2091163IPR009057Homeodomain-like
SuperFamilySSF466891.59E-1894163IPR009057Homeodomain-like
PROSITE profilePS5007116.682102162IPR001356Homeobox domain
SMARTSM003896.3E-19103166IPR001356Homeobox domain
PfamPF000462.1E-17105160IPR001356Homeobox domain
CDDcd000867.69E-18105163No hitNo description
PROSITE patternPS000270137160IPR017970Homeobox, conserved site
PROSITE profilePS5084840.874298537IPR002913START domain
SuperFamilySSF559618.65E-33299536No hitNo description
CDDcd088751.05E-111302533No hitNo description
SMARTSM002342.1E-30307534IPR002913START domain
PfamPF018524.6E-40307534IPR002913START domain
SuperFamilySSF559612.06E-18561730No hitNo description
SuperFamilySSF559612.06E-18767807No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0048497Biological Processmaintenance of floral organ identity
GO:0008289Molecular Functionlipid binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 843 aa     Download sequence    Send to blast
MFGDCQLFSS MGMGGNNNNN NNVSSDTLYS SSIQNPNFNF MTMGGNNLPF NIFPPNNIIP  60
KEENGLFKNK EEMDSGSGSE HIEGMSGNEL EPEQQQQQQQ QGGKKKRYHR HTVRQIQEME  120
ALFKECPHPD DKQRLKLSQD LGLKPRQVKF WFQNRRTQMK AQQDRSDNVI LRAENDNLKN  180
ENYRLQAALR SIMCPTCGGP AMLGEMGYDE QQLRLENARL KEEFERVCCL VSQYNGRGPM  240
QGLGPPNPLL PPSLELDMSI NNFTSKFEDQ PDCADMVPVP LLMPDQNNSQ FSGGPMILEE  300
EKSLAMELAL SSMDELVKMC TSSDPLWIRA SNDSGKEVLN VEEYSRMFPW PVGVKQNGNE  360
LKIEATRSSA VVIMNSITLV DAFLDTNKCI ELFPSIISRA KTIQVATSGV SGHASGSLQL  420
MFMEMQVLTP LVSTRECYFL RYCQQNVEEG SWAIVDFPLD SLHNNFPPPF PYFKRRPSGC  480
IIQDMPNGYS RVTWVEHAEV EENPVNQIFN HFVTSGVAFG AQRWLSILQR QCERLASLMA  540
RNISDLGVIP SPEARKSLMN LAQRMIKTFC MNISTCCGQS WTALSDSPDD TVRITTRKVT  600
EPGQPNGLIL SAVSTSWLPY NHFQVFDLLR DERRRAQLDV LSNGNSLHEV AHIANGSHPG  660
NCISLLRINV ASNSSQSVEL MLQESCTDDS GSLVVYTTVD VDAIQLAMNG EDPSCIPLLP  720
LGFVITPINN GQVNMNNSDN NVSGTEANSS QSSEKRQNLS SIQEYSGGCL LTVGLQVLAS  780
TIPSAKLNLS SVTAINHHLC NTVQQINAAL VAFYPDTEIT APSSPPPQQP KSSKQADENS  840
NS*
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAP0095930.0AP009593.1 Solanum lycopersicum DNA, chromosome 8, clone: C08HBa0018O15, complete sequence.
GenBankHG9755200.0HG975520.1 Solanum lycopersicum chromosome ch08, complete genome.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_004245919.10.0PREDICTED: homeobox-leucine zipper protein HDG5 isoform X1
RefseqXP_010325574.10.0PREDICTED: homeobox-leucine zipper protein HDG5 isoform X1
SwissprotA2ZAI70.0ROC3_ORYSI; Homeobox-leucine zipper protein ROC3
TrEMBLK4CN000.0K4CN00_SOLLC; Uncharacterized protein
STRINGSolyc08g076370.2.10.0(Solanum lycopersicum)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
AsteridsOGEA90202226
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G46880.10.0homeobox-7